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ABSTRACT 

Aims. We present a homogeneous and complete catalogue of optical groups identified in the purely flux limited (17.5 < Iab ^ 24.0) VIMOS-VLT 
Deep redshift Survey (VVDS). 

Methods. We use mock catalogues extracted from the MILLENNIUM simulation, to correct for potential systematics that might affect the overall 
distribution as well as the individual properties of the identified systems. Simulated samples allow us to forecast the number and properties of 
groups that can be potentially found in a survey with VVDS-like selection functions. We use them to correct for the expected incompleteness and 
also to asses how well galaxy redshifts trace the line-of-sight velocity dispersion of the underlying mass overdensity. In particular, we train on 
these mock catalogues the adopted group-finding technique i.e. the Voronoi-Delaunay Method (VDM). The goal is to fine-tune its free parameters, 
recover in a robust and unbiased way the redshift and velocity dispersion distributions of groups (n(z) and n(a) respectively) and maximize, at the 
same time, the level of completeness and purity of the group catalogue. 

Results. We identify 318 VVDS groups with at least 2 members in the range 0.2 < z < 1.0, among which 144 (/30) with at least 3 (J5) members. 
The sample has an overall completeness of ~ 60% and purity of ~ 50%. Nearly 45% of the groups with at least 3 members are still recovered if we 
run the algorithm with the particular parameter set which maximizes the purity (~ 75%) of the resulting catalogue. We exploit the group sample 
to explore the redshift evolution of the fraction f, of blue galaxies (U — B < 1) in the redshift range 0.2 < z < 1. We find that the fraction of blue 
galaxies is significantly lower in groups than in the global population (i.e. in the whole ensemble of galaxies irrespectively of their environment). 
Both of these quantities increase with redshift, with the fraction of blue galaxies in groups showing a marginally significant steeper increase. We 
also investigate the dependence of f b on group richness: not only we confirm that, at any redshift, the blue fraction decreases in systems with 
increasing richness, but we extend towards fainter luminosities the magnitude range over which this result holds. 
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1. Introduction 

Galaxy groups and clusters are the largest and most mas- 
sive gravitationally bound systems in the universe. Because of 
this, they are very useful cosmological probes. For example, 
the evolution of their abundance or baryon fraction give in- 
sight s into the value of fundamental cosmologica l parameters 
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Ettori et al.1 120031: Ehang et al J 120061: Ettori etafl l2009). 
their mass and luminosity functions fix the amplitude of 



the power spectrum at cluster scales (e.g. iRosati et"aL1 [2002; 
Finoguen ov et alJ l2010h . while their optical mass-to-light ra- 
tio allows to const r ain the matter density parameter Sl m (e.g. 
Girardi et alJ 120001: iMarinoni & Hudson! 120021: ISheldon et all 
2009). Groups and clusters are also ideal laboratories for as- 
trophysical studies. Several interesting physical processes are 
indeed triggered on scales characterized by such extreme den- 
sity conditions. Their analysis is crucial in particular to under- 
stand the effec ts of local env i ronment on ga l axy formation and 
evolution (e.g. IOemlerfll974t IDres sler 198ft iPostman & Gellerf 
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1.1. The detection of galaxy groups and clusters 

A whole arsenal of algorithms allows to identify and reconstruct 
galaxy systems. They range from the very first pion eer methods 
based on visual ide ntification on photometric plates (lAbelll 1958: 
IZwickv et al.ll968h to more recent techniques which exploit var- 
ious physical properties of the systems as a guide for identifica- 
tion. For example, the thermal bremsstrahlung emission from the 
hot intracluster gas trapped inside the cluster gravitational poten- 
tial allows to spot them by means of X-ray band observations. On 
the opposite side of the spectrum, in the centimetre regime, clus- 
ter detection is made possible thanks to t he Su nyaev-Zeldovich 
effect (SZE. ISunvaev & Zeldovichlll972l[T980l) . Indeed, the hot 
intracluster gas, by inverse-compton scattering the photons of 
the Cosmic Microwave Background (CMB), leaves a character- 
istic imprint in the CMB spectrum which can be exploited as a 
useful signature for identification. A cluster potential well can 
also be detected through strong gravitational lensin g or the cos- 
mic shear induced by we a k gravitational lensin g ( Knei b et al 
12003b iGavazzi et al] 120091: ILimousin etafll2009l Richard et al 
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l2010tlLrmousin et al ]l2010tlMorandi et al.ll2010l) . Clusters iden- 
tification can be based also on the properties of the member 
galaxies. It has been observed that cluster cores host typically 
red galaxies, among which there are the brightest cluster galax- 
ies (BCG). Thus, a cluster center can be identified as a R.A.-dec 
concentration of galaxies with typical r ed colours (see for ex- 
ample the Red-Sequence Cluster Survey, Gladd ers" & Yed f2000. 
the first cluster survey based on this method), in some case 
adding als o the constraint of a high luminosity (e .g. the maxBCG 
method, Hansen et al .1120051 iKoester et al. 200J). 

An orthogonal approach, based on geometrical algorithms, 
consists in identifying systems from the 3D spatial distribution 
properties of their members . These algor i thms vary f rom the 
earlier hierarchical method dMaterndl 1 978l iTullvll 1 9801) and the 
widel y used 'friend of friend' (FOF) method dHuchra & Gellerl 
119821). to the more recent 3D adaptive matc hed filter m ethod 
dKepner et all 1 19991) . the 'C4' method dMiller et all 120051) an d 
the Voronoi-Delaunay Method (VDM. lMarinoni et alT l2002). 
Finally, group-finding algorithms have also been developed 
whic h exploit informat i on extracted from p hotometric redshifts 
(e.g. lAdami et al1l2005b iMazure et aDl2007h . 

The availability of several identification protocols is not only 
useful in order to confirm clusters detection by an a-posteriori 
cross-correlation of various independent catalogues, but it is also 
crucial for spotting eventual systematics which might affect in- 
dividual detection techniques. For ex ample, it has been s hown 
by the first joint X-ray/optical survey (Don ahue et al.ll2002h that 
only ~ 20% of the optically selected clusters were also iden- 
tified in X-rays, while ~6Q% of the X-ray clusters were de- 
tected in the optical sample. Understanding the possible selec- 
tion effects hidden behind the different survey strategies is cru- 
cial in order to interpret this small overlap between the two dif- 
ferent cluster catalo gues (see for example iLedlow et al.l l2003: 
iGilbank et al.ll2004l). Moreover using the RASS-SDSS galaxy 
cluster catalogue iPopesso et al.l d2004) show that a distinct class 
of 'X-ray underluminous Abell clusters' does exist, with an X- 
ray luminosity Lx which is one order of magnitude fainter than 
the one expec ted for their mass according to the typical Lx- 
m ass relation IP opesso et al. 2007a). This supports the concern 
of [Donahue et al. (2002) about the possible existence of biases 
in catalogues selected in different wavebands. 

A major challenge we face is to extend cluster searches at 
high redshift. Indeed, most of the methods described above suf- 
fer from major drawbacks when applied in this regime. Both the 
X-ray apparent surface brightness and the gravitational lensing 
cross section of clusters decrease very rapidly with redshift. As 
a consequence, only very massive clusters can be detected at 
high z. On the contrary, the SZE detection efficiency does not 
depend on redshift, but large SZ survey are yet to be completed. 
For what concerns cluster detection using the spatial distribu- 
tion of members, we emphasize the difference between photo- 
metric and spectroscopic galaxy data sets. Several methods have 
been proposed to detect clusters with photometric data, mainly 
exploiting galaxy colours in different bands. On the one side, 
this method has been successfully used both for surveys (see 
for example the abo ve-mentioned Red-Sequence Cluster Survey, 
Glad ders & Ye e 2000) and single detections (e.g. the very recent 
work bv lAndreon et al.l2 009). but on the other hand the selection 
of red galaxies implies the selection of only the older structures, 
where galaxies lived enough time to be affected by the physical 
processes typi cal of the group en vironment (see for example the 
discussion in iGerke et al.ll2.007h . Moreover, the depth required 
in photometric surveys to identify high-z groups and clusters in- 
creases the number of foreground and background galaxies, due 



to the fact that objects surface number density is enhanced by 
the faint flux limit. This essentially limits the effectiveness of 
2D identifications at high-z. The third dimensions is thus imper- 
ative if we want to disentangle in an efficient way projection ef- 
fects. Nonetheless, the uncertainty on the line-of-sight position 
of galaxies may be a concern when it is bigger (or even much 
bigger) than the typical velocity dispersion of group galaxies, as 
in the case of photometric redshifts. 

1.2. This work and the existing groups and clusters samples 

To date, many local, optically selected group catal ogues are 
availab le in literature. A review can be found in lEke et al] 
(I2004I) . where one of the largest catalogue of galaxy groups 
detected in redshift space from the Two Degree Field Galaxy 
Redshift Survey (2dFGRS) is presented. Similarly, several 
group catalogues have been extrac t ed from the Sloan D igital 
Sky Survey data (e.g iMiller et all I2005I: iBerlind et all [2006; 
IWein mann et aD l2006l) . Systematic searches of groups in red- 
shift space have been undertaken also at intermediate redshift 
(e.g. within the CN OC2 survey, up to redshift z = 0.55, 
ICarlberg et al.ll200lt) . The compilation of optically selected and 
complete samples of groups up to z ~ 1 and beyond has 
been made possible only recently thanks to the completion 
of large and deep spectr oscopic surveys, s uch as the DEEP2 
Galaxy Reds hift Survey dPayis et all 120031) . the VIMOS-VLT 
Deep Survey dLe Fevre et al.l l2005l). and the zCOSMOS survey 
(lLillvetal.ll2007U2009h . 

IGerke et al.l (l2005F present the first DEEP2 group catalogue: 
it contains 899 groups with two or more members identified in 
the redshift range 0.7 < z < 1.4 with the VDM method. The 
DEEP2 sample reaches a limiting magnitude of Rab = 24.1, 
and its galaxies are pre-selected in colour before being targeted 
for spectroscopic observations, in order to reduce the number 
of galaxies at z < 0.7. The first zCOSMOS group catalogue 
(Kno bel et ali r2009) comprises ~ 800 groups with at least 2 
members, covering the redshift range 0.1 < z < 1.0. The par- 
ent galaxy sample is purely flux limited (15 < Iab ^ 22.5), and 
groups are detected with the FOF method, combined with the 
VDM. 

In this wor k, we make use of the VIMOS-VLT Deep 
Survey (VVDS. iLe Fevre et al J 120051) to compile an homoge- 
neous optically-selected group catalogue in the redshift range 
(0.2 < z < 1.0). We run the VDM code on a sample contain- 
ing more than 6000 flux limited galaxies (17.5 < Iab ^ 24.0) 
for which reliable spectroscopic redshifts have been measured. 
Particular attention has been devoted to optimally tune the pa- 
rameters of the group-finding algorithm using VVDS-like mock 
catalogues. The selection function of the sample, essentially 
compensating only for the flux limited nature of the survey, is 
simple and mostly insensitive to possibly uncontrolled bias such 
as those which might affect colour selected samples. Moreover, 
the magnitude depth of the VVDS allows us to sample a galaxy 
population which is fainter in luminosity than that currently 
probed by other flux-limited surveys of the deep universe. 

The paper is organized as follows: in ^2]the data sample and 
the mock catalogues are described. The reliability of the virial 
line of sight velocity dispersion as estimated using galaxies is 
discussed in $3] In SI we review the basics of the VDM group- 
finding algorithm, while the strategy followed to fine tune its 
parameters is presented in S|5] In ^6] we describe the properties 
of the VVDS group catalogue. The redshift evolution of the U - 
B colour of group galaxies is analyzed in $7] Conclusions are 
drawn in $8] 
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We frame our analysis in the context of a A Cold Dark 
Matter model (ACDM) specified by the following parameters: 
Q„, = 0.3, £2 A = 0.7, H = 70 km s" 1 Mpc" 1 . Magnitudes are 
expressed in the AB system. 



2. DATA SAMPLE and MOCK CATALOGUES 

2.1. The VVDS-02h sample 

The VIMOS-VLT Deep Survey (VVDS) is a large spectroscopic 
survey whose primary aim is to study galaxy evolution and 
large scale structure for mation. The survey det ailed strategy and 
goals are described in iLe Fevre et al] d2005h . VVDS is com- 
plemented by ancillary deep photometri c data that have been 
collected at the CFHT telescope (BVRI, ILe Fevre et all F2 004: 
McCracken et al.ll2003t) . at the NTT telescope (JK. IIovino et al.l 
120081) and at the MPI telescope (U, 
Also u*, g', r', V, z'-band data are avail- 
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Radovich et al.ll200- 
able as part of the CFHT Legacy Survey. The full suite of spec- 
troscopic and photometric data provides a superb database to ad- 
dress in a wide redshift range many open questions of modern 
observational cosmology. 

In this paper we make use of the data collected in the VVDS- 
0226-04 Deep field (from now on "VVDS-02h field"), where the 
spectroscopic observations have targeted objects in the magni- 
tude range 17.5 < Iab ^ 24.0. In this range the parent photo- 
metric sample is complete and free fro m surface brightness se- 
lection effects (McCrac ken et al.ll2003l) . resulting in a deep and 
purely flux-limited spectroscopic sample. Spectroscopic obser- 
vations (the so-called "first epoch" data) in the VVDS-02h field 
were carried out at the ESO-VLT with the Visible Multi-Object 
Spectrograph (VIMOS), a 4-channel imaging spectrograph, each 
channel (a quadrant) covering ~ 7 x 8 arcmin 2 for a total field 
of view (a pointing) of ~ 218 arcmin 2 . The observations used 1 
arcsecond wide slits and the LRRed grism, covering the spectral 
range 5500A< A <9400A. The resulting effective spectral res- 
olution is R ~227, while the runs accuracy of the redshift mea- 
surements is ~ 275 km/s dLe Fevre et alj 2005). 

The VVDS-02h field covers a total sky area of 0.7 x 0.7 
deg 2 , targeted by 1, 2 or 4 spectrograph passes. This strategy 
produces an uneven target sampling rate as shown in Figure Q] 
The multiple-pass strategy assures that there is no serious under- 
sampling of the denser regions, at least in the ~ 80% of the field 
covered by two or more spectrograph passes. It should be no- 
ticed that some quadrants had to be discarded due to their poor 
quality and not all the regions of the field covered by the same 
number of passes have the same sampling rate. In average, spec- 
tra have been obtained for a total of 22.8% of the parent photo- 
metric catalogue. Due to low signal-to-noise ratio and/or to the 
absence of useful spectral features, only ~ 80% of these targeted 
objects yield a redshift, giving an overall sampling rate of ~ 18% 
(~ 33% considering only the area covered by 4 passes). 



VVDS-02h field first epoch sample probes a comoving vol- 
ume (up to z=1.5) of nearly 1.5 x 10 6 /i~ 3 Mpc 3 in a standard 
ACDM cosmology. This volume has transversal dimensions ~ 
37x37 /r'Mpc at z=1.5 and extends over 3060 /r'Mpc in ra- 
dial direction. 

The collected sample contains 6615 galaxies and AGNs 
with secure redshifts, i.e. redshift determined with a quality 
fl ag=2,3,4,9 (6058 with 0.2 < z < 1.5). We refer the reader 
to ILe Fevre et al.l (|2005) for further details about redshift qual- 
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Fig. 1. Uneven spectrograph coverage in VVDS-02h field. The 
grey-scale from the lightest to the darkest grey indicates the 
sampling rate, with corresponding values shown in the label. 
The grid used for the colour-code had steps of 30" in Right 
Ascension and Declination, and for each grid position we used 
squares of size 7' to estimate the sampling rate. The central area 
covered by 4 VIMOS passes is highlighted by a solid line. 



ity flags. Here we only emphasize that, comparing spectroscopic 
redshifts of objects observed twice by means of independent 
observations, we conclude that redshifts with flag=2(/3/4) are 
correctly estimated with a likelihood of 81(/97/> 99)%. We as- 
signed a flag=9 when in the spectrum there is only a single se- 
cure spectral feature in emission. Given the spectral range cov- 
ered by observations and the flux limits of the survey, this emis- 
sion line is typically [OII]3727A or Ha (in very rare cases 
Lya). Thus flag=9 redshifts have a probability of being cor- 
rect of ~ 50%, being based on the choice between the two 
most probable emission lines. We double-check the robustness 
of the likelihood assigned to flag=2 and flag=9 objects, by con- 
trasting spectroscopic estimates against photometric determi- 
nations. Photome tric redshifts were computed as described in 
Ilber t et al.l (12006). but now using the more recent T0005 release 
of CFHTLS data (u*, g', r', i',z filters) and the latest data avail- 
able from WIRCAM (J, H and K filters, Bielby et al. in prepa- 
ration). According to this comparison, flag=2(/9) redshifts are 
correctly inferred with a likelihood of 78(/59)%, a figure which 
is in good agreement with the independent determination dis- 
cussed above. 

Note, also, that the conclusions of our work are unaffected 
by the fact of including or not in our analysis flag=9 low quality 
redshifts. As a matter of fact these objects constitute a small frac- 
tion (< 3%) of the whole sample. Moreover, the effect of possi- 
ble biases induced by wrong redshift estimates are weakened by 
the very existence of the galaxy correlation on small scales: if a 
galaxy with flag=2 is located nearby (on the sky) to other galax- 
ies with similar (but more secure) redshift, the likelihood that 



4 



O. Cucciati et al.: The VIMOS VLT Deep Survey: the group catalogue 



it shares the same redshift actually increases with respect to the 
probability determined on the basis of our analysis. 

2.2. Mock catalogues 

We made extensive use of mock catalogues, both to test the po- 
tential for group searches of the VVDS-02h field data and to 
tune the parameters of the group-finding algorithm for optimal 
detection. 

Before introducing any particular group-finding algorithm, 
one needs to test which limits in group reconstruction are im- 
posed by the specific characteristics of the VVDS survey design. 
With mock catalogues mimicking VVDS-02h field we were able 
to explore which groups are lost irretrievably due to the sur- 
vey sparse galaxy sampling. Furthermore we were able to assess 
how our measurement of the line of sight velocity dispersion of 
group galaxies is degraded by both the sampling rate and the 
non negligible VVDS redshift measurement error. After having 
explored these limits, we then moved to test and optimize the 
group-finding algorithm, within the ranges in redshift and ve- 
locity dispersion where we found that VVDS-02h data allow a 
reliable group reconstruction. 

Mock catalogues wer e obtained by appl y ing th e semi- 
analytic prescriptions of iDe Lucia & Blaizotl d2007l) to the 
dark matter halo merging trees e xtracted from the Millennium 
SimulatiorPl ( Snrin gel et al.l l2005). The simulation contains N = 
2160 3 particles of mass 8.6 x 1O 8 /i _1 M within a comoving 
box of size 500 /i~'Mpc on a side. The cosmological model 
is a ACDM model with Q m = 0.25, Q. b = 0.045, h = 0.73, 
Qa = 0.75, n — 1 and cr 8 = 0.9. The positions and velocities 
of all simulated particles were stored at 63 snapshots, spaced ap- 
proximately logarithmically from z = 20 to the present day. Dark 
matter halos are identified using a standard friends-of-friends 
(FOF) algorithm with a linking length of 0.2 in units of the mean 
particle separation. 

In this simulation, group galaxies are those in the same FOF 
halo, identified with a unique ID. For each simulated group a 
wealth of physical information are available: galaxy member- 
ship, virial mass (computed directly using the simulated parti- 
cles), virial radius and virial velocity dispersion (both computed 
from the virial mass, through scaling laws and the virial theo- 
rem). The virial mass is computed within the radius where the 
halo has an overdensity 200 times the critical density of the sim- 
ulation. 

It is worth noticing that the model used to construct light- 
cones from the MILLENNIUM simulation has been shown to 
be quite successful in reproducing several basic properties of 
our real data set . The most importa nt are the average redshift 
distribution n(z) (M eneux et alj200"8l) and the global Luminosity 
Function LF (Zucca et al., in preparation), that are in good agree- 
ment with the real VVDS-02h n(z) and LF, with the only excep- 
tion of a slight excess of galaxies in the n{z) mock samples for 
z < 0.5. It should be noticed that such a small difference in n{z) 
does not affect the completeness and purity values (see Section 
15. Il l of our group catalogue, as we specifically tested using sep- 
arately the mocks wi th the most similar a nd the most differ- 
ent n(z). Moreover, in iMeneux et alJ J2008) it is shown that the 
galaxy clustering in the MILLENNIUM simulation light cones 
is consistent with the one measured using the VVDS-02h sam- 
ple. 

Through the Database bui l t for t he Millennium Simulation 
dLemson & Virgo Consortium! 120061) . we selected 10 (1 x 1) 
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deg 2 independent MILLENNIUM li ght cones (generated with 
the code MoMaF, Blaizot et al. 2005), from which we extracted 
several kinds of mocks, according to our purposes. First of all, 
we extracted (lxl) deg 2 flux limited samples, with the same 
flux limits as VVDS-02h sample (17.5 < I AB < 24). These cat- 
alogues have 100% sampling rate, and no redshift measurement 
error added. We called these catalogues M(100, 0), the first num- 
ber in brackets indicating the sampling rate and the second the 
redshift error. Then we randomly depopulated these catalogues 
to obtain subsets with 33%, 17% and 10% sampling rate, mim- 
icking roughly the sampling rate of the 4 passes, 2 passes and 1 
pass areas of the VVDS-02h field. These catalogues are called 
M(33,0), M(17,0) and M(10,0) respectively. With these mock 
catalogues and taking advantage of the known group member- 
ship we were able to assess how much a group catalogue is de- 
populated when sampling rate is lowered to values typical of 
those of VVDS-02h field. 

As a further step, we added redshift measurement errors to 
the 33% sampling rate mocks, randomly chosen from a Gaussian 
distribution centered on with <x = 275 km/s. This way we take 
into account the mean redshift measurement error of our real 
data. We called these mock catalogues M(33,275). With these 
mock catalogues, we were able to test how well we can deter- 
mine group virial velocity dispersion when the survey has flux 
limits, sampling rate and redshift measurement errors mimicking 
those of the 4 passes areas of the VVDS-02h field. 

As a last step we needed mock catalogues to test how ef- 
fective is the group-finding algorithm we adopted in identify- 
ing those groups surviving in a sample like the VVDS-02h one. 
To test the efficiency of our algorithm we used 20 "VVDS- 
like" mocks extracted from MILLENNIUM simulation. These 
mocks have the same flux limits, geometry, uneven sampling 
rate, redshift erro r m easurement as the V VDS-02h sample (see 
iPollo et all 120051 and IMeneux et al.ll2008l for the preparation of 
these mocks). Subtler effects, like those introduced by slit po- 
sitioning bias, have also been included, as the same slit posi- 
tioning tool used for VVDS-02h sample has been used, with the 
same optimization criteria, to generate the VVDS-like mocks. 
Moreover, the areas masked in the real photometric catalogue 
because of bright stars and because of a beam of scattered light 
have also been masked in the VVDS-like mocks. 

For the sake of clarity, we emphasize that whenever we 
refer to the 'FOF' or 'simulated' groups in all the above- 
mentioned mock catalogues, we mean the sets of galaxies within 
the same original FOF halo provided by the simulation itself, 
before any depopulating process: we never ran any FOF algo- 
rithm on mocks after extracting M( 100,0), M(33,0), M(17,0) 
and M(10,0), M(33,275) and "VVDS-like" mocks from simu- 
lations. 



3. Preliminary tests 

3.1. Testing the effects of VVDS survey strategy on groups 

In this section we explore how well the group catalogue ex- 
tracted from a VVDS-like survey trace the group population of 
an ideal survey which is purely flux-limited. In a real flux lim- 
ited galaxy survey with a sampling rate lower than 100%, most 
groups will have a smaller number of members and some will 
even go undetected. We want to assess the fraction of groups that 
"survive" as such (i.e. with at least 2 members) in a survey with 
a sampling rate like the one in VVDS-02h. To identify groups, 
both in the full flux limited and in the various 'observed' cata- 
logues, we used at this phase the identification number of FOF 
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groups in the Millennium database. In other words, we consider 
only the limitations introduced by the survey strategy, neglect- 
ing for the moment further complications introduced by the in- 
completeness/failures of the specific group finding algorithm we 
used. 

In Figure|2]we plot the fraction of groups in mock catalogues 
flux limited at 17.5 < Iab ^ 24.0 that survived after apply- 
ing a sampling rate corresponding to 1/2/4 passes regions (i.e. 
10%, 17% and 33% respectively) as indicated by different lines. 
Practically, we plot the ratio between the number of groups in 
M(10,0), M(17,0) and M(33,0) catalogues and the number of 
groups in M(100, 0) catalogues. This ratio has been computed in 
not-independent running redshift bins of Az = 0.3: continuous 
lines are fits along all the bins, while for reference the ratios cor- 
responding to the M(33,0) catalogues are also shown for each 
redshift bin as red diamonds. Note that the number of groups 
with cr vir > 650 km/s is quite low, mainly because of the small 
field of view, thus the fraction of survived groups at z < 0.8 
fluctuates around a mean value that we use to fit a straight line. 
These fluctuations, however, only in the worst cases are as high 
as 10%. This is also true for M(10,0) and M(17,0) catalogues, 
for which we do not plot single points not to crowd the figure. 
The horizontal dashed line at a fraction value equal to 50% is 
shown for reference. The three panels correspond to different 
cuts in the virial line of sight velocity dispersion (cr v ,,>) quoted 
in the mocks, as indicated by the label (from now onwards all 
velocity dispersions quoted will be line of sight velocity disper- 
sions). 

Figure |2 shows that in 2 and 4 passes areas we can recover 
the majority (> 50%) of groups down to cr V!r ~ 350 km/s in 
the full redshift range below z = 1.0. Obviously going to higher 
values for cr vir allows to extend further the redshift range. Such 
lower limit for <r v / r is in agreement with the one imposed by the 
non negligible redshift measurement error of VVDS survey. As 
we will see in the next paragraph, for groups with cr vir < 350 
km/s our measurements of velocity dispersion are quite unreli- 
able. 



3.2. Estimating group virial l.o.s. velocity dispersion 

A robust determination of the line-of-sight velocity dispersion of 
galaxies in group is essential if we are to infer the group mass in 
a reliable way. When group members are sparsely sampled, as it 
is the case fo r VVDS-02h data, the "gapper method", originally 
suggested bv lBeers et al.1 (Il990t) . has prove d to be the most ro - 
bust velocity dispersion estimator (see also lGirardi et al.|[l993l) . 
This method measures velocity dispersion exploiting the veloc- 
ity gaps in the given velocity distribution of galaxies, using the 
following formula: 



N(N - 1) 



£ i(N - i)(v M 



v,) 



(1) 



where the line of s i ght ve locities v, are sorted into ascend- 
ing order. iBeers et all y990) show in their Table II that this 
method reliably estimates the velocity dispersion with an effi- 
ciency > 90% for groups with ~ 5 - 10 elements, thanks to its 
robustness in recovering the dispersion of a distribution also in 
the more general case of a contaminated Gaussian distribution. 
It is important to emphasize that this range of group members 
is well suited for the study we present in this work. On the one 
hand, we consider the velocity dispersion reliably measurable 
only for groups with at least 5 members, and on the other hand 
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Fig. 2. The fraction, as a function of redshift, of "surviving" 
groups when the sampling rate is decreased from a purely flux 
limited simulated sample 17.5 < I A b < 24.0 (i.e. with 100% 
sampling rate) to ~33% (4 passes area, red line), ~17% (2 
passes, blue line) and ~10% (1 pass area, green line). The frac- 
tion has been computed in not-independent running redshift bins 
of Az = 0.3: continuous lines are fits along all the bins, while for 
reference the fractions corresponding to the M(33, 0) catalogues 
are also shown for each redshift bin as red diamonds. Different 
panels show different cuts in <t,,,>, as quoted in each panel. The 
horizontal dashed line at a fraction value equal to 50% is for 
reference. 

the large majority of groups surviving in "VVDS-like" mocks 
have < 10 members. 

Hereafter, when discussing "measured" velocity dispersions 
(o~meas) we will refer to velocity dispersions obtained apply- 
ing the gapper method to the members of the given group. Of 
course, we corrected this velocity dispersion taking into account 
the scaling between redshift and velocity. Thus we used: 



1+z 



(2) 



where z is the redshift of the group. 

In this section we want to test whether our measurement 
of the line of sight velocity dispersion cr meas is a reliable esti- 
mate of the virial velocity dispersion cr V! > (as listed in the mock 
catalogues). For this comparison we used M(100, 0), M(33,0) 
and M(33,275). We called the <j meas of these three kinds of 
catalogues <x(100, 0), <x(33,0) and <x(33,275) respectively. In 
the case of a non-zero redshift measurement error, such as in 
M(33,275) mock catalogues, we took the error itself into ac- 
count when computing cr meas , so that the error (v err ) was sub- 
tracted in quadrature as follows: 



o-(33,275) 2 



max[Q, <x(33, 275)^ -v 2 err ] 
(1+z) 2 



(3) 



where <x(33,275)g is the velocity dispersion measured in 
M(33,275) mocks with Equation Q] v err = 275 km/s and z is 
the redshift of the group. When <x(33,275) = we considered 
the velocity dispersion not measurable given the redshift error. 
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Fig.3. Comparison of line of sight velocity dispersion cr meas computed in M(100,0), M(33,0) and M(33,275) mocks (I s ', 2 nd and 
3 rd column respectively) with the virial velocity dispersion cr vl >. The first row is for the redshift bin 0.2 < z < 0.6, and the second 
for 0.6 < z < 1 .0. In each plot, the upper panel shows isodensity contours in the plane cr meas versus cr w > for groups with at least 
5 members. Blue triangles are the median (on x axis) and mean (on y axis) values of single points grouped in bins of <r V! >, with 
vertical error bars being the rms of mean values. The bins on the x axis have the following limits: 0.5 - 150 - 250 - 350 - 450 - 550 
- 1 100 km/s. Purple squares are the same as triangles but for groups with at least 8 members. The lower panel in each plot shows 
the percentage error (with its rms) when we compare cr meas and cr V!r . Triangles and squares are plotted only when the corresponding 
<T V i r bin contains at least 15 objects. See text for further details. 



Figure [3] shows the comparison of cr V! > with cr(100, 0), 
<x(33,0) and cr(33,275), respectively in the first, second and 
third column. The first row is for the redshift bin 0.2 < z < 0.6, 
and the second for 0.6 < z < 1.0. In each plot, the upper panel 
shows isodensity contours in the plane cr meas versus <j V i r for 
groups with at least 5 members. Blue triangles are the median 
(on x axis) and mean (on y axis) values of single points grouped 
in bins of cr V! >, with vertical error bars being the rms of mean val- 
ues. As a reference, purple squares are the same as triangles but 
for groups with at least 8 members. The lower panel in each plot 
shows the systematic offset of the relation in the upper panel; the 
offset is expressed as a percentage error (with its rms) computed 
as follows: 



Cr vir 

where cr meas is <x(100, 0), <x(33, 0) and cr(33, 275) in the three 
columns respectively. Symbols have the same meaning as in the 
upper panel. 

Results graphically shown in Fig. [3] can be summarized as 
follows: 



1) Effects due to the WDS-02h flux limit. The plots in the first 
column show that even in the ideal case of purely flux lim- 
ited mock catalogues with 100% sampling rate and without 
redshift measurement error, the measured velocity dispersion 
cr(100, 0) systematically underestimates cr v , r . This system- 
atic offset, shown in the lower part of the plots, is always 
below 20%, with a smaller scatter for increasing cr v ,> and for 
the lower redshift bin. Such an offset can easily be under- 
stood by noticing that in a flux limited survey, even with a 
100% sampling rate, moving to higher redshifts groups will 
progressively lose the fainter members that lie outside the 
selected flux range. As a consequence the measured velocity 
dispersion underestimates the real virial velocity dispersion, 
as the "surviving" galaxies are the brighter, usually found in 
group cores. 

2) Effects due to the lower sampling rate introduced by WDS- 
02h strategy. The plots in the second column show that if we 
decrease the sampling rate from 100% to 33%, our ability 
in recovering cr vir decreases as well, as expected. The sys- 
tematic offset is not significantly worse than in mocks with 
100% sampling rate, but the scatter around the systematic 
offset is larger, especially for low cr vir . 
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3) Effects due to VVDS redshift measurement error. Finally, the 
plots of the third column illustrate the fact that when we add 
275 km/s of redshift error, low <t,, (> are very difficult to re- 
cover, while, for cr,,„. > 350 km/s, the systematic offset and 
its rms remain below 25% with a slightly higher scatter for 
the higher redshift bin. 

Figure [3] convincingly demonstrates that the estimate of the 
velocity dispersion is robust only in groups with cr V!r > 350 
km/s. It also serves the following purpose: it forecasts the preci- 
sion with which the measured velocity dispersion traces a spe- 
cific o~ v i r of the matter particles in the halo once the VVDS sam- 
pling rate and spectroscopic uncertainties are taken into account. 
As a matter of fact, when we analyze the real VVDS-02h group 
catalogue, only the estimate o- meas are available, and nothing is 
known about a" VJ >. We should therefore ask also the reverse ques- 
tion of how far a given value of cr vir is from the observed o~ meas . 
This means that we have to take cr meas as reference when we 
compute the percentage error. We show the results of this anal- 
ysis in Figure |4] in which, for any given bin of cr(33,275) we 
plotted the mean systematic offset from the real underlying cr v , r , 
computed as a percentage error on cr(33, 275) as follows: 



[cr(33,275)-cr W) .] 
o-(33,275) 



x 100. 



(5) 



Also in this case, when considering <x(33,275) greater than 
350 km/s we are able to recover cr v ,> with an error of < 30% 
and < 20% for groups with N > 5 and > 8 respectively. 
Therefore, for velocity dispersions above 350 km/s and despite 
the relatively large error in redshift measurements causes a sys- 
tematic increase in the estimated <x(33, 275), we can still recon- 
struct a sensible value of o- vir in the VVDS-2h 4 passes region. 
We carried on a similar check for groups in 1 pass and 2 passes 
regions, and verified that results are qualitatively similar to those 
obtained in the 4 passes area. 

Globally, our analysis suggests that we can use VVDS-02h 
data as a suitable sample for extracting high-z groups. 

4. The group-finding algorithm 

Several geometrical algorithms have been proposed to identify 
groups and clusters from the 3-dimensional distribution of galax- 
ies, that is by optically identifying them within spectroscopic 
redshift survey (see Section[TJ. 

In this work we identified groups using the Voronoi- 
Delaunay Method dMarinoni et al.1 120021) . In short, the VDM 
combines information from the three-dimensional Voronoi di- 
agram an d its dual, the Delaunay triangulation. The Voronoi 
diagram (IVoronoii ri908) is a polyhedral partition of 3D space, 
each polyhedron surrounding a galaxy and defining the unique 
volume containing all the points that are closer to that galaxy 
than to any othe r galaxy in the sample. The Delaunay complex 
dPelaunavll 19341) also contains proximity information. It is de- 
fined by the tetrahedra whose vertices are sets of four galaxies, 
that have the property that the unique sphere that circumscribe 
them does not contain any other galaxy. The center of the sphere 
is a vertex of a Voronoi polyhedron, and each face of a Voronoi 
polyhedron is the bisector plane of one of the segments that link 
galaxies according to the Delaunay complex. 

The basics of the VDM code is as follows. The denser the 
environment in which a galaxy live, the smaller the Voronoi vol- 
ume which is associated to it. Therefore the Voronoi partition 
allows for an immediate identification of the central regions of 



1000 



800 



.2 600 



0.20<z<0.60 




* 5 
■NJ 8 



200 400 600 800 1000 

sigma(33%,275) 



1000 



800 



.2 600 



100 
50 


-50 
-100 



0.60<z<1.00 





200 400 600 800 1000 
sigma(33%,275) 



Fig. 4. As the plots in the last column of Figure [3] but with 
exchanged axes. In this case the binning is made according to 
cr(33, 275) and not cr,,,y. 



structures. Complementarity, the Delaunay triangulation assigns 
galaxy members to the identified core. Note that a crucial dif- 
ference between the VDM and other methods is that, since it 
preliminarily identifies group centers, group membership recon- 
struction proceed radially outward, from the densest cores to- 
wards the outskirts of the structures. 

An advantage offered by the Voronoi-Delaunay method is 
that it exploits the natural clustering of the galaxies in the sam- 
ple. For example the dimension of the volume assigned to each 
galaxy locally depends on the number density of the objects sur- 
rounding the galaxy itself. It is thus adaptively and unparamet- 
rically rescaled and not predefined on the basis of some fixed 
length parameter. Moreover, galaxies that are Delaunay con- 
nected to the central cores are processed with cylindrical win- 
dows whose dimensions are locally scaled on the basis of physi- 
cal relations observed in simulated (and real) samples of groups 
and clusters. The specific set of VDM parameters is thus de- 
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signed in such a way to offer the maximum of flexibility in se- 
lecting groups according to the a-priori physical information we 
have on their structure. As a consequence, a fine-tuned VDM al- 
gorithm has been proven to be very efficient in reconstructing 
intrinsic characteristics of groups, such as for e xample the line 
of sig ht velocity dispersion of their members ( Ma rinoni et al.l 
120021) . 

The Voronoi-Delaunay method was specifically designed to 
avoid some known drawbacks characterizing standard group- 
finding algorithms such as for example the FOF and the hierar- 
chical methods. These methods are based on user-specified pa- 
rameters (the FOF linking length, the 'affinity' threshold in the 
hierarchical method) that do not depend on the real distribution 
of galaxies. One of the negative consequences is that spatially 
closed but unrelated structures are often merged together in a 
single system. Moreover, some dynamical properties of clusters 
are very sensitive to the adopted group-finding algorithm: for 
example, the velocity dispersion of groups identified by the FOF 
algorithm is found to be systematically higher (by nearly 30%) 
than that of groups found by the hierarchical algorithm, even 
when both algo rithms are optimized on the same galaxy sample 
dGiuricinetal.1120011) . 

We will now briefly summarize how the V DM works, al- 
though detailed de scriptions can be found in iMarinoni et alJ 
(2002) and also in lGerke etail d2005l) (from which we adopted 
some technical improvements). 

At first, the algorithm compu tes the Voronoi-D elaun ay mesh 
follow ing the prescriptions in Ba rber et"a"D d 19961) and iMirtichl 
(1996). It then searches for groups with a 3-step procedure. At 
each step, new group members are identified by means of a cylin- 
drical window (of radius R and half-length L) which is used to 
scan Delaunay connected galaxies and to decide whether or not 
they are cluster members. Phase I concerns the 3-D identification 
of group seeds. In Phase II the algorithm determines group cen- 
tral richness, and finally in Phase III an adaptive scaling based 
on N-cr relation is used to rescale cylinder dimensions depend- 
ing on group richness found in Phase II. A detailed explanation 
of each of these three steps follows in sections |4~T1 14. 2l and l4. 31 

The radius and the half-length of the cylinders of Phase I (??/ 
and X/) and of Phase II CRu and X//), together with r and I, the 
scaling factor used to determine respectively the radius and the 
half-length of the cylinder of Phase III {'Rui and X///), are free 
parameters of the algorithm. They have to be optimized using 
physical information about clusters. 

The choice of a cylindrical shape for the search window is 
physically motivated by the fact that the gravitational field of 
galaxy overdensities induces peculiar velocities whose effect is 
to make the galaxy distribution to appear elongated in the red- 
shift direction. The only way to take this into account is by us- 
ing a search window with a radial extension much longer than 
the transversal dimension, in order not to miss group members. 
N ote that we use a cylin drical window also in Phase I, while 
in Marino ni et al.l J2002) Phase I search window had a spheri- 
cal shape. The original choice of a spherical window for the first 
phase was physically motivated by the fact that galaxies resid- 
ing in the highest density peaks, i.e. the central cores of groups 
and clusters, are expected to have smaller peculiar velocities. 
However, we verified that for less rich systems, i.e. loose groups 
as those we expect to recover in the VVDS sample, the best 
choice is a cylindrical window. In particular, the survey quite 
large redshift measurement error together with the sparse sam- 
pling rate motivate our new choice. 

As we want the length of search cylinders to correspond 
roughly to the peculiar velocity of the galaxies in the group, 



we have to consider that the mapping between redshift interval 
and peculiar veloc ity changes with redshift, and thus, following 
iGerke et afl d2005l) . our algorithm automatically rescales cylin- 
der lengths X(z) as a function of z, using the equation: 

X(z) = [s(z)/s(z )]-C(z ), (6) 
where zo is a reference redshift (see Section l4~2l for details) 

and 



Vo M (i+z) 3 + n A ' 

This scaling as a function of redshift is applied to all X/, X// 
and tin- 

4.1. Phase I. 

In Phase I galaxies are at first ranked according to the increas- 
ing size of their Voronoi volume. A cylinder with radius %i 
and half length X/ is then centered on the galaxy with smallest 
Voronoi volume. All galaxies inside the cylinder and Delaunay- 
connected with the central galaxy are considered group mem- 
bers and called first-order Delaunay neighbours. The central 
galaxy and its first-order Delaunay neighbours are considered 
a group seed. In the case there are no other galaxies in the cylin- 
der, the central galaxy is rejected as potential seed. Thus, the 
choice of Kj and X/ determines the final number of identified 
groups. At the end of this Phase the barycenter of the seed is 
computed, using the positions of the central galaxy and its first- 
order Delaunay neighbours. 

Then the algorithm processes the full sequence of Phases for 
the found seed. After Phase II and Phase III are completed, the 
whole procedure is reiterated by selecting from the sorted list the 
first galaxy not yet assigned to a group. 

4.2. Phase II. 

In the second phase a different cylindrical window with radius 
fin and half length X// is centered on the barycenter determined 
in Phase I, and it is used to determine the central richness of the 
group. All galaxies that fall in the Phase II cylinder and are con- 
nected to the first-order Delaunay neighbours are called second- 
order Delaunay neighbours, and are considered further group 
members. The total number of group members after this phase 
(the central galaxy plus first- and second-order neighbours) is 
considered as the central richness Nu of the group. 

A reliable estimate of Nu is important as it controls the adap- 
tive search window used in Phase III (see below). From one 
hand, the fact of considering only Delaunay-connected galaxies 
minimizes the inclusion of interlopers in Nu- On the other hand, 
in a flux limited survey like VVDS, the Nu distribution varies as 
a function of redshift, because of the variation of the luminosity 
limit with redshift. To ensure a uniform group population, Nu 
has to be corrected as a function of z: 

N% rr (z) = Nu^l (8) 
<v(z)> 

Here zo is the redshift zero-point considered as reference, 
and (v(z)) is the comoving number density, that we calculated 
by smoothing the redshift distribution of the galaxy sample, and 
then dividing it by the diffe rential comoving v olume element at 
the considered redshift. In IGerke et alJ ([2005), Zo is the lower 
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limit of the DEEP2 galaxy redshift distribution n(z), i.e. zo — 0.7. 
In the case of VVDS-02h sample, the lower limit in n(z) is z — 
0.2, but at this redshift the volume covered by the VVDS-02h is 
small. Because of this, (v(z = 0.2)) can be poorly constrained. 
Moreover, (v{z)) decreases very rapidly from z = 0.2 up to z — 

1.0. Thus we chose zo = 0.7 as a compromise between high 
statistics (it is roughly the peak of our n(z) distribution) and not 
yet so large survey volume. 

At the end of Phase II the barycenter position is readjusted 
using all Nu members. 

4.3. Phase III. 

Finally, in Phase III the algorithm reconstructs the full set of 
group members, using a new search window which is centered 
on the group barycenter determined at the end of Phase II and 
with dimensions determined according to the following basic 
scaling relations. 

Assuming that groups are singular isothermal spheres, at 
any given distance r from the center the mass density distribu- 
tion is related to th e velocity dispersion throu gh the equation 
p(r) = (r 2 /(2nGr 2 ) dBinnev & Tremainelll988l) . Since M(r) = 
4nr 3 p(r)/3, and by defining r V! > as the radius of a spherical vol- 
ume within which the mean density is A c times the critical den- 
sity at the considered redshift, we find that M V! > oc cr 3 , where 
M V i r = M(r V i r ) is the virial mass. The virial theorem implies 
that M V i r oc cr 3 oc R 3 . therefore, exploiting the correlation be- 
tween velocity dispersion and central richness, which has been 
confirme d from loose g roups up to massive clusters (for exam- 
ple, see Bahcail ll98ll we end up with the following chain of 
relations N c l ° rr oc M V! > oc cr 3 oc P 3 . 

Accordingly, we let the central richness Nj° rr of each group 
to control both the radius and the length of the cylindrical search 
window: 

- <R IU = r(^7 r ) 1/3 ; 

- Liu = l(N™ rr ) 1 ' 3 . 

Here, r and I are normalization parameters to be optimized 
using simulations. Note that the adaptive search window of 
Phase III will differ from group to group and that all galaxies en- 
closed within the cylinder are considered as further group mem- 
bers, irrespectively of the order of their Delaunay connections. 
From now on, we will call richness N the final number of mem- 
bers assigned to each group at the end of Phase III. 

5. Optimizing the group-finding algorithm 

5.1. Success criteria 

In this section we detail the optimization strategy devised to re- 
construct groups in the most reliable and unbiased way. To this 
purpose, we used VVDS-like mock catalogues. We applied the 
VDM algorithm to them, and compared the groups found by 
the algorithm with the groups present in the mocks identified by 
the same FOF identification number (see Subsection l2.2l ). From 
now on, we will refer to the FOF groups in the mocks as "fidu- 
cial" groups, while groups reconstructed by our algorithm will 
be called "reconstructed" groups, or simply "VDM" groups. 

There are two levels of success we are interested in: 1) suc- 
cess in finding groups, i.e. to establish the level of contamination 
by interlopers and fake groups, the percentage of missed galax- 
ies and missed groups and other statistics of this kind; 2) success 
in reproducing group properties, i.e. to accurately measure group 



properties on a group-by-group basis, and also to reproduce their 
statistical distribution as accurately as possible. 

To test VDM algorithm success in finding the fiducial groups 
present in the VVD S-like mocks, we used the following quality 
estimators (see also iMarinoni et af 1 120021 and iGerke et afl [2005 
for more details): 

- galaxy success rate S ga i'. fraction of galaxies belonging to 
fiducial groups that are identified members of reconstructed 
groups; 

- interlopers fraction ff. fraction of galaxies identified by the 
algorithm as members of reconstructed groups but that are 
interlopers; 

- completeness C: fraction of fiducial groups that are "success- 
fully" identified in the reconstructed catalogue; 

- purity P: fraction of reconstructed groups that "correspond" 
to fiducial groups. 

Hence, we now need a quantitative measure to determine 
whether a fiducial group is detected "successfully" and whether 
a reconstructed group "corresponds" to a fiducial one. To this 
purpose, we consider a detection to be successful when more 
than half of a fiducial group members are detected in the same 
VDM group. On the contrary, a VDM group corresponds to a 
fiducial one when more than half of its members belongs to that 
fiducial group. In general, these two conditions can be verified 
independently. These general cases are called one-way matches 
from one group catalogue to the other (from fiducial to VDM or 
in the opposite direction). But when these conditions are verified 
simultaneously involving the same fiducial and VDM group in 
both directions, we have a two-ways match. So we can have a 
one-way completeness (C\) and a one-way purity (Pi) when we 
consider only one-way matches in the fiducial and in the recon- 
structed group catalogue respectively, but also a two-way com- 
pleteness (C2) and a two-way purity (P2) can be defined when 
considering two- ways matches. 

On the one hand, knowing absolute value of completeness 
and purity will help us in optimizing the algorithm, but on the 
other hand comparing C\ with C2 and Pi with P2 we can es- 
tablish the kind of errors in the reconstructed group catalogue. 
In fact, when C\ » C2, it means that some fiducial groups 
are one-way successes but not two-ways matches, and thus these 
fiducial groups contain a low fraction of the members of their 
reconstructed associated group. This is an indication that VDM 
algorithm tends to overmerge separated groups in bigger recon- 
structed groups, or to assign to reconstructed groups too many 
interlopers. On the other hand, when Pi >> P2 we know that 
VDM algorithm is doing the opposite error, i.e. the reconstructed 
group catalogue is highly fragmented with respect to the fiducial 
one. 

We decided to use these indicators to search for the best set 
of parameter for our algorithm following some guide lines. The 
basic idea is to obtain C\ and C2 as high as possible, while keep- 
ing Pi and P2 at least above 50%. Moreover, we would like not 
to produce a highly overmerged {C\ » C2) or a highly frag- 
mented (Pi >> P2) catalogue, and therefore we tried to obtain 
Ci ~C 2 andP! *P 2 . 

5.2. Algorithm optimization 

We applied the VDM algorithm to 20 VVDS-like mocks, obtain- 
ing group catalogues for the full redshift range 0.2 < z < 1 .5, but 
for the reasons discussed in Section lXTl we implemented the op- 
timization strategy only in the range 0.2 < z < 1 .0. 
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With a trial and error approach we explored the flexibility of 
the 6 VDM parameters in recovering groups in a robust way. We 
allowed each parameter to vary in a wide range. In particular, 1) 
we let %i and Hn increase up to 1 /t'Mpc, with no lower limit: 
this because we wanted the radii to span projec ted dimension s 
up to typical central radius of massive clusters dBahcaUlll98lb . 
2) We let r span the range 0.4 < r < 1.5, as we want the radius 
of the last search cylinder to be eq ual or larger than sm all groups 
typical size (~ 0.5 A _1 Mpc, see iBorgani etallll997l and refer- 
enc es therein) and smal ler than an Abell radius (~ 1.5 /r'Mpc, 
see[Borgani et al.l l 1997b . 3) We let X/, -Cn and I vary from 4 to 
20 /z^'Mpc, to include clusters with velocity dispersion as high 
as 2000 km/s also at high redshift (z ~ 1). In this case the lower 
limit is mainly suggested by our redshift measurement error, that 
has to be added to peculiar velocities. We imposed to Hm and 
Hui the same limits as r and /. Nevertheless, we also checked the 
performances of the algorithm when no limits are applied to 'Rm 
and X///, and we verified that, with the exception of very few 
cases, Hm and X.m 'behave well', as we expected as the whole 
algorithm is based on physical scales and scaling laws. 

Exploring the 6D parameter space, we found the parame- 
ter set that kept C\ and Ci as high as possible and Pi and Pi 
at least above 50%, while monitoring also the behavior of the 
group properties, both on a group by group basis and on a statis- 
tical point of view. Then we moved slightly around these chosen 
values with smaller steps, to search for a possible finer tuning. 

At the end of this finer search, we found the following pa- 
rameter set, from now on called the best set of parameters: 

- Hi = 0.28 fc _1 Mpc 

- £/ = 7.0 fc _1 Mpc 
-Ru = 0.6 A _1 Mpc 
-Ln = 5.0/r'Mpc 

- r = 0.55 h^Mpc 
- 1 = 14.0 A _1 Mpc 

We assigned to each group a redshift and a position in the 
R.A.-Dec. plane, respectively defined as the median values of 
redshift, Right Ascension and Declination of the group mem- 
bers. 

Values of the quality parameters C\, Ci, P\, Pi, S ga i and 
// can be found in Table [T] Note that, to test the quality of the 
algorithm also as a function of redshift, we considered separately 
two redshift bins (0.2 < z < 0.6 and 0.6 < z < 1 .0). 

We analyzed completeness and purity also as a function of 
group richness. Figure|5]shows C\ and Ci as a function of "fidu- 
cial" group members and P\ and P2 as a function of "recon- 
structed" group members. One-way statistics are shown as blue 
squares, and two-ways statistics are shown as red crosses. C and 
P have been computed separately in each mock. In Figure [5] we 
plot C and P values averaged over all mocks, while error bars 
are their rms. The differences between C\ and Ci and between 
Pi and P2 indicate that our group catalogue will not be com- 
pletely free neither from overmerging nor from fragmentation. 
Moreover, Figure [6] shows that, while the galaxy success rate 
S g a i does not vary much as a function of N, the interloper frac- 
tion fi decreases by a factor of ~ 2 from N >2to N >9. 



5.3. Tests on recovered group properties 

The n(z) distribution. We analyzed how well the "fiducial" group 
distribution n /,y(z) as a function of redshift is recovered by the 
distribution rtvDM(z) of the groups found by the algorithm. We 
averaged the n /,-</(z) distribution over 20 independent V VDS-like 
mocks to obtain its mean value, plotted as a continuous line in 



Table 1. Quality statistics (C\, C2, Pi, Pi, S ga i and fi, see text 
for details) of the reconstructed group catalogue, for two differ- 
ent redshift bins and for the whole redshift range. The first table 
shows Ci, C2, Pi, Pi, S gai and fi for all groups, while the sec- 
ond table for groups with at least 3 members. Each parameter is 
computed as the mean over the 20 VVDS-like mocks, and the 
associated error is its rms. 

Quality statistics for N>2 



Quality parameter 


0.2 < 


z < 0.6 


0.6 < 


z< 1.0 


0.2 <. 


i< 1.0 


Ci 


0.68 


±0.03 


0.57 


±0.04 


0.63 = 


= 0.03 


c 2 


0.56 


±0.04 


0.49 


±0.05 


0.53 = 


= 0.03 


Pi 


0.56 


±0.02 


0.55 


±0.04 


0.56 = 


= 0.02 


Pi 


0.48 


±0.04 


0.50 


±0.04 


0.49 = 


= 0.03 


S gal 
fl 


0.72 


±0.03 


0.59 


±0.03 


0.67 = 


= 0.02 


0.38 


±0.02 


0.43 


±0.04 


0.40 = 


= 0.02 


Quality statistics for N>3 


Quality parameter 


0.2 < 


z<0.6 


0.6 < 


z < 1.0 


0.2 < , 


:< 1.0 


Ci 


0.73 


±0.06 


0.57 


±0.05 


0.67: 


= 0.04 


c 2 


0.65 


±0.06 


0.52 


±0.06 


0.60 = 


= 0.03 


Pi 


0.61 


±0.03 


0.58 


±0.05 


0.60 = 


= 0.03 


P2 


0.50 


±0.05 


0.51 


±0.07 


0.50 = 


= 0.05 


S gal 


0.75 


±0.04 


0.60 


±0.04 


0.70: 


= 0.03 




0.35 


±0.03 


0.40 


±0.05 


0.37: 


= 0.02 
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Fig. 6. Interlopers fraction fi (empty symbols) and galaxy suc- 
cess rate S ga i (full symbols) for different lower limits of group 
richness (x axis). Diamonds are for the redshift bin 0.2 < z < 
0.6, triangles for 0.6 < z < 1.0 and squares for the entire range 
0.2 < z < 1 .0. S ga i and fi have been computed separately in each 
mock. In this plot, points are S ga i and // values averaged over all 
mocks, while error bars are their rms. 



Figure|7] In the Figure, the mean Hvdm{z) for the same 20 inde- 
pendent mocks is shown as black points, with vertical bars being 
the rms among the 20 mocks. The plot shows that the difference 
between «vdm(z) and n/,d(z), despite the presence of fake and/or 
missing groups in the VDM catalogue, is within the errors. A 
X 1 test between the two mean distributions gives x 2 = 1 -4. We 
therefore conclude that the two n(z) distributions are statistically 
consistent with each other, even if there is a tendency for having 
more VDM reconstructed groups at low redshift. We repeated 
the same test using only groups with at least 5 members and with 
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1 10 1 10 

N(fid) N(VDM) 

Fig. 5. C\ and C2 statistics as a function of "fiducial" group members (on the left) and Pi and P2 statistics as a function of "re- 
constructed" group members (on the right). One-way statistics are shown as blue squares, and two-way statistics are shown as red 
crosses. C and P have been computed separately in each mock: in these plots points are C and P values averaged over all mocks, 
while error bars are their rms. On the x axis we grouped the number of elements in the following way: [N=2],[N=3,4] ,[N=5,6] and 
|N>7] 



cr > 350 km/s, that is those groups for which we are sure we can 
compute a reliable velocity dispersion, and we found also in this 
case that «vdm(z) and «/id(z) are consistent with each other. 

Velocity dispersion. As discussed above, the comparison be- 
tween group properties in the fiducial and in the reconstructed 
catalogue is an important check to verify that VDM algorithm 
is not only able to recover real groups, but also to preserve their 
characteristics. This means that when we compare the two cat- 
alogues on a group-by-group basis, the fractions of interlop- 
ers and missing galaxies modify group properties only below 
some tolerance level. The same has to hold also for the fidu- 
cial and reconstructed statistical distributions of these properties, 
when considering that the reconstructed catalogue contains fake 
groups and it misses some groups. 

For each VDM group, we measured the velocity disper- 
sion cr VDM of its galaxies using Equation [1] correcting it as in 
Equation |3 Figure [8] shows the comparison between the veloc- 
ity dispersion in the reconstructed groups {cr VDM ) and the virial 
velocity dispersion (cr V! > quoted in the simulations) of the fidu- 
cial groups in VVDS-like mocks on a group-by-group basis. 
Only two-ways matches are considered. The Figure is divided 
in two panels as Figure |4] the upper part shows the scatter plot, 
the lower shows the percentage error, computed as in Figure |4] 
Green and blue triangles are groups with at least 5 members, or- 
ange and purple squares groups with at least 8 members; green 
and orange points are single groups, while blue and purple sym- 
bols are the median (on x axis) and mean (on y axis) values in 
bins of the property on the x axis. Vertical error bars are rms of 
mean values. 

This scatter plot shows the following: on a group-by-group 
basis, for o~vdm ^ 350 km/s, close to the intrinsic limit set by 
the flux-limited nature of the VVDS catalogue, the correlation 
between o~ yom and cr vir is such that cr VDM overestimates cr V! >, but 
on average always by < 30% for groups with at least 5 members, 
while this overestimate is on average < 10% for groups with at 
least 8 members. In fact, we have shown in Section l3~2l that the 
velocity dispersion cr meas that one can measure in groups within 



a VVDS-like data sample is not a reliable estimator of o" v / r for 
cr meas < 350 km/s. 

Besides the group-by-group comparison, it is also interest- 
ing the analysis of the velocity dispersion distributions, thus in- 
cluding unrecovered and fake groups in the fiducial and in the 
reconstructed catalogues respectively. Figure [9] shows the com- 
parison between n(cr 1 ,, > ) and n(cry DM ) distributions (the solid line 
and the black diamonds respectively). Values on the y axis are 
averaged over 20 VVDS-like mocks. Vertical bars associated to 
o~vdm points are their rms over the 20 mocks. One can notice 
that the area below the two distributions is different. This mainly 
because in o~ vdm distribution we excluded groups for which we 
were not able to measure cr, i.e. groups for which we imposed 
o~vdm = 0. This comparison shows that the two distribution 
agree for cr > 350 km/s, as confirmed through a^ 2 test between 
the two mean distributions for cr > 350 km/s. 

As a further test for the recovered o~vdm distribution, we 
compared it with the n(cr,,„.) in mock catalogues with the same 
flux limits as VVDS-02h sample but with 100% sampling rate 
(the M(100, 0) catalogues presented in Subsection l2.2b . and with 
the n(cr,,„.) of mock catalogues with no flux limits (the com- 
plete light cones from which M( 100,0) catalogues have been 
extracted). In Figure[l0] we show the normalized mean n(cr VDM ) 
(black diamonds) for cr > 350 km/s. It is the same distribution as 
in Figure [9] but it is normalized by the total numbers of groups 
with cr > 350 km/s. Overplotted green triangles represent the 
normalized mean n(cr vir ) of fiducial groups in M{ 100,0) mock 
catalogues, and the orange crosses are the normalized mean 
«(cr VI >) distribution of fiducial groups in complete light cones of 
the MILLENNIUM Simulation. For each distribution, the red- 
shift range considered is 0.2 < z < 1 .0. Considering these nor- 
malized distributions for cr > 350 km/s, a test between the 
k(o~vdm) and the n(cr vir ) for M(100, 0) catalogues leaves us with 
X 2 such that we can conclude that the two distributions are sta- 
tistically in agreement. We obtain the same result when we ap- 
ply the same test between n{cr VDM ) and n(cr vir ) for the complete 
catalogues. This means that the m(o> dm ) of the groups recon- 
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Fig. 7. Mean distribution of fiducial groups as a function of red- 
shift (continuous line), computed as the average over 20 VVDS- 
like mocks. The mean distribution of VDM-reconstructed 
groups over the same 20 mocks is over-plotted as black points. 



structed by our algorithm is unbiased with respect to the n(cr V! >) 
of groups in the complete light cones. 

We repeated the tests shown in Figures l9l and [TOl also using 
only groups with at least 5 members, and we found similar re- 
sults. 

As discussed in Section |4] one of the primary goals of the 
VDM is to be able to recover the virial line of sight velocity dis- 
persion of group galaxies, at least above some minimum thresh- 
old. This is not achieved, for example, by other commonly used 
group-finding algorithms, such as the FOF method (see Section 
@). The comparisons of «(<x) distributions between reconstructed 
and fiducial groups presented in this Section show that this aim 
has been successfully obtained in a deep redshift survey such as 
VVDS, at least up to z — 1. Moreover, VVDS redshift measure- 
ment error and sampling rate imposed an a priori lower limit for 
a reliable measurement of the line of sight velocity dispersion 
of group galaxies (cr > 350 km/s, see Subsection l3.2l ). We have 
shown in Figures l9l and [TOl that the finding group algorithm we 
used not only can recover a reliable n{cr) distribution above some 
minimum cr, but also it does not worsen the minimum cr thresh- 
old imposed by the survey strategy itself. This result has been 
reached thanks to the flexibility of the 6 VDM parameters. Each 
of them has a specific role in determining the choice of the group 
members, through an intuitive localization of group barycenters 
(Phase I), a reliable estimate of the central richness (Phase II) 
and a correct exploitation of group scaling laws (Phase III). 

Sampling rate. As we applied the algorithm to the VVDS- 
like mocks, we optimized it for the whole observed area (~ 0.5 
deg 2 each), irrespective of the varying sampling rate across the 
field. Nevertheless, we tested also how completeness and purity 
change if computed separately in areas with very different sam- 
pling rate, that is covered by 1, 2 or 4 passes of the spectro- 
graph (hereafter called 'lp', '2p' and '4p' areas). For this test, 
we assigned each group to the lp, 2p or 4p area according to 
its R.A-Dec. position (computed as the median value of R.A. and 
Dec. of the member galaxies), even if it extends over an area 
with a sudden drop/increase of the sampling rate. Considering 
the whole redshift range 0.2 < z < 1 .0, in the 4p area we find 
d = 0.67 + 0.04, C 2 = 0.54 ± 0.04, P x = 0.55 + 0.03 and 
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: 0.20<z<1.00 
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Fig. 8. Comparison between virial (y axis) and VDM recon- 
structed (x axis) group velocity dispersion. Only 2 ways matches 
are considered in this plot. The upper panel shows the scatter 
plot, the lower shows the percentage error. Green and blue tri- 
angles are groups with at least 5 members, orange and purple 
squares groups with at least 8 members; green and orange points 
are single groups, while blue and purple symbols are the median 
(on x axis) and mean (on y axis) values in bins of the property 
on the x axis. Vertical error bars are rms of mean values. 
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l.o.s. velocity dispersion 

Fig. 9. Mean distribution of virial line of sight velocity disper- 
sion (continuous line), computed as the average over 20 VVDS- 
like mocks. The mean distribution of cr of groups reconstructed 
by the VDM, averaged over the same 20 mocks, is over-plotted 
as black points, with vertical bars corresponding to its rms. 

P 2 = 0.48 ± 0.08, while in the (l+2)p areas Ci = 0.58 ± 0.04, 
C 2 = 0.52 ± 0.04, P x = 0.57 ± 0.03 and P 2 = 0.49 ± 0.04. 

While C 2 , Pi and P 2 differences are inside error bars, one can 
notice a larger worsening in Ci when we decrease the number of 
spectrograph passes, i.e. the sampling rate. Moreover, analyzing 
the dependence of C and P on group richness, we can add that 
in the 4p area completeness is higher even for N > 5. Moreover, 
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Table 2. Quality statistics (C\, C2, Pi, Pi, S ga i and fi, see text 
for details) of the group catalogue reconstructed by the algorithm 
with the high-purity parameter set, for two different redshift bins 
and for the whole redshift range, considering groups with N>2. 
Each parameter is computed as the mean over the 20 VVDS-like 
mocks, and the associated error is its rms. The same parameters 
for groups with N>3 are consistent, within error bars, with those 
presented here. 

Quality statistics for N>2 



500 600 700 800 
I.o.s. velocity dispersion 



Quality parameter 


0.2 <. 


:<0.6 


0.6 < . 


1 < 1.0 


0.2 < , 


:< 1.0 


C, 


0.29 = 


t0.03 


0.20 = 


t0.03 


0.24- 


= 0.02 


c 2 


0.24 = 


t0.03 


0.17 = 


t0.02 


0.20 = 


= 0.02 


Pi 


0.75 = 


t0.04 


0.73 = 


t0.06 


0.74 = 


= 0.04 


Pi 


0.66 = 


t0.07 


0.69 = 


t0.06 


0.67 = 


= 0.05 


S gal 

fi 


0.32 = 


t0.03 


0.21 = 


tO.04 


0.28: 


= 0.03 


0.24 = 


t0.04 


0.27 = 


= 0.05 


0.25 = 


= 0.03 



Fig. 10. Normalized mean distribution of cr VDM (black dia- 
monds) for cr > 350 km/s. It is the same distribution as in 
Figure|9] but it is normalized by the total number of groups with 
cr > 350 km/s. Overplotted green triangles represent the normal- 
ized mean cr vir distribution of fiducial groups in MILLENNIUM 
mock catalogue with flux limits at Iab = 24 and with 100% sam- 
pling rate (M(100, 0) mock catalogues, see Subsection l2.2l i. and 
the orange crosses are the normalized mean cr Wj . distribution of 
fiducial groups in complete light cones of the MILLENNIUM 
Simulation, i.e. catalogues with no flux limits. For each distribu- 
tion, the redshift range considered is 0.2 < z < 1.0. 



we also notice that in 4p area there is a higher overmerging, es- 
pecially for N < 4, while in the (l+2)p area fragmentation is 
increased for N > 5. 



5.4. High purity parameters 

With the best set of parameters, we can obtain from VVDS- 
02h data a group catalogue with high completeness, but it has 
been shown that only ~ 50% of groups is pure. This means that 
each group identified by the algorithm has, on average, only 50% 
probability of being a real group. It could be useful to identify 
the subsample of groups that has an even higher probability of 
being real. Thus, we optimized the group-finding algorithm a 
second time, in this case maximizing purity (but paying atten- 
tion not to reduce the new recovered group catalogue to a few 
'super-secure' groups). The so called high-purity parameter set 
is the following: 

-% = O.lO/r'Mpc 

- Li = 5.0 /r'Mpc 

- ft,, = 0.6 /i-'Mpc 

- tn = 5.0 ^Mpc 

- r = 0.55 /i-'Mpc 
- 1 = 14.0 h~ l Mpc 

Table [2] shows C and P for the high-purity parameter set. 
Necessarily, C is very low, but now each group identified by the 
algorithm has ~ 70% of probability of being real, and the inter- 
lopers fraction // decreases from ~ 40% to ~ 25% with respect 
to the one obtained with the best set of parameters (see Table[TJ). 



6. VVDS-02h field group catalogue 

We applied the group-finding algorithm to the VVDS-02h sam- 
ple described in Section 12.11 using the best set of parameters. 
We defined the redshift and the position in the R.A.-Dec. plane of 
each group as the median values of redshift, Right Ascension and 
Declination of the group members. Figure QT| shows the redshift 
distribution of the identified groups, with different line styles for 
different cuts in group richness, as indicated in the Figure. It is 
clear that beyond z ~ 1 there is a significant drop in the number 
of recovered groups, irrespectively of their richness, as expected 
from Figure [2] This drop in the redshift distribution may partly 
be related also to the choice of optimizing the algorithm only up 
to z — 1 (see Section 15721 . We applied the VDM to our galaxy 
sample also using the high-purity set of parameters. With the 
best set of parameters, the algorithm identified 318 groups with 
2 or more members in the redshift range 0.2 < z < 1.0, one third 
of them having also been detected with the high-purity set. The 
identified groups comprise ~ 19% of the galaxies in our sample. 
Comparing this percentage with the fraction of galaxies that re- 
side in groups in VVDS-like mock catalogues, we found that it 
is consistent with both the fraction of galaxies residing in fidu- 
cial groups (~ 20 %) and the percentage of galaxies residing in 
reconstructed groups (~ 22%). 

For each group we estimated the line of sight velocity dis- 
persion cr. We used the gapper method, as described in Section 
13.21 and we corrected it for the redshift measurement error sub- 
tracting it in quadrature as in Equation [3] We set cr = km/s 
for those groups with a measured cr G (from Equation Q} lower 
than the redshift error. ~ 25% of groups with cr > 350 km/s 
have been detected by the algorithm also with the high-purity 
parameter set. 

It is worth noticing that, given the small value of the param- 
eter r, driving the projected dimension of the search cylinder in 
Phase III (see Section [4~3b , the typical projected radius within 
which the full set of group members is selected is always < 1 
/j _1 Mpc. 

Detailed group catalogue statistics are shown in TablefJ] The 
number of groups that has been found in VVDS-02h field is 
quoted. Different rows are for different values of velocity dis- 
persion, different columns for different richnesses. Numbers in 
brackets indicate the number of groups that have been identi- 
fied by the algorithm also with the high-purity set of parameters 
(even if with less members). 

We tested the reliability of the reconstructed catalogue by re- 
computing the groups excluding galaxies with flag=2 and 9, i.e. 
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Fig. 11. The redshift distribution of groups in the VVDS-02h 
sample, found using the best set of parameters. Different line 
styles are for different cuts in group richness, as indicated. The 
total number of groups with the corresponding richness is quoted 
in the labels, for two different redshift ranges. 



Table 4. List of groups recovered in the VVDS-02h field in the 
range 0.2 < z < 1.0. Columns are the following: 1) group ID; 
2) R.A.; 3) declination; 4) redshift; 5) number of detected mem- 
bers; 6) l.o. s. velocity dispersion cr; 7) possible high purity. R.A. 
and declination are in degrees and cr in km/s. R.A., declination 
and redshift are the median values of all the galaxies in each 
group. The star near the group ID label those groups found by 
the algorithm also when using only galaxies with flag 3 and 4 
(see text for further details). Column 7 labels with an 'H' those 
groups detected by the algorithm also with the high-purity set 
of parameters. See the electronic edition for the complete list of 
VVDS-02h groups. 



grID 


R.A. 


Dec. 


z 


N 


cr 


Purity 




deg. 


deg. 






km/s 




124* 


36.56310 


-4.31748 


0.5850 


3 


351 


H 


144* 


36.79910 


-4.59669 


0.6135 


5 


428 


H 


224* 


36.80323 


-4.67940 


0.7898 


4 


392 





Table 5. List of group galaxies belonging to the groups listed 
in Table |4] Columns are the following: 1) galaxy ID; 2) R.A.; 
3) declination; 4) redshift; 5) redshift quality flag (see Section 
12.11 ); 6) ID of the group to which the galaxy belongs 7) total 
number of group members. R.A. and declination are in degrees. 
See the electronic edition for the complete list of VVDS-02h 
group galaxies. 



using only galaxies whose redshift has a high likelihood (> 95%) 
of being correct. We verified that, with respect to our original 
group catalogue, 80% (/77%/75%) of the groups with at least 5 
(/4/3) members are still recovered. These means that for these 
recovered groups the galaxies with flag=2 and 9 were not in the 
seed of the group, i.e. in the first set of galaxies recovered in the 
Phase I of the algorithm (see Section l4~TT i. 

Table |4] lists all the groups identified in the redshift win- 
dow 0.2 < z < 1.0. It is worth noticing that the quoted num- 
ber of members has to be considered as a lower limit for the 
real richness, as the sampling rate of our survey is not 100%. 
The groups labeled with a star near their ID are those recov- 
ered also when using only galaxies with flag=3 and 4. We apply 
this label only to groups with at least 3 members. The group 
members are presented in Table Note that the galaxy ID is 
the same used to identify galaxies in the public VVDS release at 
http://cencosw.oamp.fr 

In Figure[12] the two-dimensional VVDS galaxy distribution 
is shown, with galaxy positions projected on Right Ascension 
and redshift. Each plot shows a different redshift bin, as quoted 
on the y axis. Black dots are field galaxies, while coloured dots 
are group members (blue dots are pair members, green are triplet 
members, orange are quartet members and red dots are galaxies 
included in groups with 5 or more members). 

6. 1 . Line of sight velocity dispersion of group galaxies 

It is interesting to check whether the real universe looks like the 
simulated one. In this section we compare the VVDS catalogue 
with the Millennium-based mock catalogues. 

We compared the n(cr) distributions of real and simulated 
groups. Figure [13] shows the n(cr) distribution for all VVDS- 
02h groups in the redshift range 0.2 < z < 1 .0 (red triangles) 
and the n(cr) distribution for VVDS-like mock catalogues. As in 
Figure|9] the continuous line is the distribution of cr v! > of fiducial 
groups, while black points represent the mean distribution for 
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°) The galaxy ID refers to the public VVDS release at 
http://cencosw.oamp.fr 



reconstructed groups, with vertical bars being the rms of the 20 
mock catalogues. In this plot we consider the cr measured with 
the gapper method (for both mocks and real data) and not the 
virial velocity dispersion. Moreover, we are excluding groups 
with measured cr equal to 0, because this value indicates that 
we have not been able to measure it due to the redshift mea- 
surement error (see Equation[3]l. This is the reason why the area 
under n(<x vlr ) in the plot is larger than the area under the other 
distributions. In this Figure one can notice a consistency in the 
«(cr) distributions of real and mock group catalogues, at least for 
cr > 350 km/s. 

The relatively large number of groups for which the velocity 
dispersion estimated through Equation [3] is formally negative is 
probably due to the fact that we did not take into account possi- 
ble dependences of the mean redshift error on the properties of 
the galaxies (i.e. magnitude, presence of emission lines etc.). It is 
likely that for many of these groups the redshift error associated 
to their galaxy members is somewhat smaller than the adopted 
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Table 3. Number of VVDS-02h groups reconstructed by the algorithm using the best set of parameters in VVDS-02h field, for 
0.2 < z < 1.0. Statistics are quoted as a function of the number of group members (columns) and of measured line of sight 
velocity dispersion of group galaxies (cr, in km/s, rows). Numbers in brackets indicate the number of groups found also applying 
the algorithm with the high-purity set of parameters. 
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") We refer the reader to Section|6jfor the meaning of cr = 



average value (275 km/s). Nevertheless, we are reassured by the 
fact that none of the groups with N > 5 has cr — 0. 



6.2. Comparison with other group catalogues in the same 
field 

Several group catalogues have already been compiled from 
different kind of observations and with different methods in 
the sky area covered by the VVDS-02h field. For exam- 
ple, X-ray clusters have been identified from XMM-Newton 



images and then spectroscopically confirmed (Andreo netal 



2004b; Valtchanov et al. 2004; Andreon et al. 2005; Willis et al 
2005 albt iBremer et alj|2006t IPierre et alj|2006l). The m atched- 
filter technique has been used as well (lOl sen et al 



20071) 



together with a weak lensing search (G avazzi & Soucail 
120071) and structure identification through photometric redshift 
dMazure et al.ll20O7h . All these latter methods were applied to 
photometric data from CFHTLS. 

Among the X-ray clusters of the XMM-LSS, only 8 clusters 
fall in the VVDS-02h field area in the redshift bin 0.2 < z < 1.0: 
XLSSC 005, XLSSC 013 and XLSSC 025 from CI catalogue, 
XLSSC 038 form C2 catalogue an d then the clusters a, b, c, d 
from C3 catalogue (see Table 3 in IPierre et alj [2006). We find 
that both clusters b and c have a counterpart in our VDM cat- 
alogue (with 6 and 8 detected members respectively), with an 
almost perfect match in their barycenters. Clusters XLSSC 013 
and XLSSC 025 have possible counterparts at the same z (with 
4 and 3 members respectively), but their barycenters in ra-dec 
have a shift of ~ 200 h^ 1 kpc. Inspecting these two groups more 
in details, we find that the possible XLSSC 025 counterpart is 
dominated by a massive galaxy that is distant from XLSSC 025 
barycenter ~ 100 hr x kpc, showing that in this case a better 
match would have been obtained if we had computed a mass- 
weighted barycenter. On the contrary, for XLSSC 013 counter- 
part we do not identify any dominant galaxy. This shift of ~ 200 
h~ l kpc could also be explained by the following: we studied the 
distances between the barycenters of VDM groups and their cor- 
responding fiducial groups, and we found that their distribution 
is a Gaussian centered at ~ with a scatter of ~ 200 hr x kpc. 
Finally, we do not find counterparts for XLSSC 005, XLSSC 
038, a and d in our catalogue. They fall inside our low sampling 
rate areas (covered only by 1 or 2 passes of the spectrograph), 
and a further inspection confirmed that the sampling rate in those 
regions does not allow our algorithm to find at least two galaxies 
inside the volume enclosed in the Phase I cylinder. 

We concluded this comparison with XMM-LSS detections 
inspecting the relation between optical and X-ray properties of 
the four groups for which there exists a (possible) XMM coun- 
terpart. In particular, we considered the relation between the X- 



ray luminosity Lx presented in Table 5 of Pi erre et all d2006l) and 
the velocity dispersions cr we have measured. We verified that 
groups XLSSC 013, b and c have a cr-Lx relation well in agree- 
m ent with the linear fit in the plane cr-Lx presented in Figure 13 
of Popess o~et alJ (120051) . For group XLSSC 025 we measure a cr 
that would be too low for its quoted Lx, according to the shown 
relation, but as its cr is of the order of 200 km/s it does not reside 
in the cr range that we consider reliably measured. 

It is worth noticing that our richest groups (10 groups with at 
least 7 members) do not match with XMM-LSS clusters, except 
one that is the counterpart of the X-ray selected group c. There 
are at least three reasons why an optical group may have not 
been detected in X-ray: a) it may falls on the boundaries of a 
XMM-LSS pointing, thus in a region where the X-ray detector 
is affected by vignetting; b) it may have a redshift much higher 
than the mean z reachable by the performed X-ray observations; 
c) it may have a low surface brightness, that corresponds to a 
shallow potential well of the mass distribution, thus making X- 
ray detection more difficult. We inspected our richest groups, 
and we found that all of them fall in at least one of these three 
categories. In particular, we verified that the N(z) distribution of 
all X-ray clusters in the above-cited works is peaked at z ~ 0.4, 
while the N{z) distribution of our richest groups is quite flat and 
reaches z ~ 0.9, with 5 groups with z > 0.7. Moreover, at least 
half of our richest groups do not have a dominant member, that 
is a galaxy with luminosity and/or stellar mass much higher than 
the others. The VVDS-02h field sampling rate could be enough 
to explain this lacking of dominant galaxies, but in principle we 
can not reject the hypothesis that the dominant galaxy in (some 
of) these groups may not exist, and in this latter case we are 
allowed to think that these groups have a real low X-ray surface 
brightness. 

We compared our gr o up catalogue also wit h the ones in 
lGavazzi & Soucaill d2007l) , lOlsen et all d2007l) and lMazure et alJ 
d2007h . 

Among the about 20 clusters in lOlsen et a l] d2007h that are 
inside the sky area and redshift range that we have explored, 
roughly half fall inside regions with too low sampling rate for 
our Phase I cylinder to be able to detect at least a pair; two of 
them (ID 30 and 42) fall very near in redshift to two wide struc- 
ture at z ~ 0.32 and z ~ 0.45 , within which our algorithm detects 
(possibly fragmenting them) a few groups. Finally, considering 
the depth of the redshift bins in which Olsen's groups can reside 
(Az ~ 0.1) due to the use of photometric redshifts, we find that 
for 5 groups in Olsen's catalogue there exists a counterpart in 
our catalogue. 

Am ong the about 30 structures detected by iMazure et alJ 
(2007) in the redshift range 0.2 < z < 1.0, we find that about 
20 fall inside regions with too low sampling rate for our finding 
group algorithm (13 of which in the 1 pass area); a few of them 
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Fig. 13. As in Figure [9] but in this case the velocity dispersion 
distribution of VVDS-02h field groups is also shown with red 
triangles. 
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pling rate areas, thus in regions where our algorithm did not de- 
tect any group. 

In this comparison we also took into account that in the 
CFHTLS data that have been used in the three above-mentioned 
works there are masked sky regions that have not been used 
for group finding, as it is shown for example in F igure 1 in 
iMazure et all (|2007) and Figure 9 in lOlsen et ail (120071) . We find 
that ~ 5% of our groups in the range 0.2 < z < 1.0 fall in 
those regions. Moreover, we observe that roughly half of this 
masked area falls inside the region that in VVDS-02h field has 
the highest sampling rate (the central area highlighted in Figure 
Q}, and that this higher-sampling region covers only ~ 25% of 
the VVDS-02h field. Thus, the percentage of our groups falling 
in the masked areas increases to ~ 8% for groups with at least 
3 members and to 20% for our 10 richest groups (those with at 
least 7 members). 



0.600 



Fig. 12. Two-dimensional VVDS galaxy distribution as a func- 
tion of Right Ascension and redshift (points are compressed on 
the Declination dimension). Each plot shows a different redshift 
bin (0.2 < z < 0.6 and 0.6 < z < 1 .0 in upper and lower panel re- 
spectively). Black dots are field galaxies, coloured dots are group 
members, according to the group catalogue obtained with the 
best set of parameters. The colour code is the following: blue 
dots are pair members, green are triplet members, orange are 
quartet members and red dots are members of groups with 5 or 
more members. 



reside in redshift slices (z ~ 0.3, z ~ 0.7 and z ~ 0.9) where a 
wide (in ra-dec) structure is also present, that has been possibly 
fragmented by our algorithm, producing in our catalogue more 
than one counterpart . Finally, three of the structures detected by 
Mazu re et a D (l2007h have a possible direct counte rpart in our 
catalo gue (general ID 5, 19 and 21, see Table 3 in lMazure et alJ 

l2007h . " 

Fin ally, the 3 structures detected by iGavazzi & SoucaiH 
(2007) and that fall inside VVDS-02h field are in very low sam- 



7. The U-B colour of group galaxies 

Having reconstructed a catalogue of groups at high z, we want 
now to use it to study the dependence of galaxy properties on en- 
vironment, and also its evolution with cosmic time. More specif- 
ically, we aim at investigating if physical properties of group 
galaxies are different from the properties of the entire sample of 
galaxies, up to z ~ 1 . Are the relations that we see in groups at 
low redshift already present at z ~ 1 ? Is there any unambiguous 
signature of time evolution in known scaling relations character- 
izing galaxies in cluster environments? In this paper, we will not 
carry on an exhaustive analysis of this topic, that will be possibly 
the goal of a future work. The main aim of this paper is to present 
the VVDS-02h field group catalogue and discuss its reliability. 
So, in this Section we simply want to show the potentiality of 
our group catalogue for studies related to environmental effects 
on galaxy properties on group scales. 

As we want to investigate the redshift evolution of the prop- 
erties of group galaxies, we have to study a group sample homo- 
geneous at all z- We thus require that the groups we use for this 
analysis have at least two members brighter than a luminosity 
limit that allows us to be complete up to z = 1 . This luminos- 
ity limit evolves with redshift. Following roughly the evolution 
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of M* , the charac teristic magnitude of the luminosity function 
(lllbert et al.ll2005l) . we set this limit as Mb < —18.9 — Liz. Our 
'group galaxy' sample is composed by those galaxies, brighter 
than this limit, in groups with at least two members brighter than 
this limit itself. Moreover, we define a 'total' galaxy sample con- 
sidering all galaxies brighter than this limit (including also those 
in groups). 

Once defined the sample, as a first step we studied the frac- 
tion of 'blue' galaxies (/;, from now on) in both the group and 
total samples, in the range 0.2 < z < 1.0. The general blueing 
of cluster galaxy population for i ncreasing redshift, first sh own 
bv lButcher & Oemlerl (Il978l) and lButcher & Oemlerl (11984 and 
known as the Butcher-Oemler effect, ha s been widely confirmed 



in following studies ( s ee for example Margoniner et al.l [200 1 ; 
iDe Propris etaT] 120031 IGerke et all 120071) . Nevertheless, nowa- 
days there is no full agreement about the origi n of this blue- 
ing. I t can be related to environmental effects (e.g. lDressler et al] 
1997), but it has also been suggested that it is consistent with the 
overall ageing of all galaxie s, irrespective of their environment 
dAndreon et al.ll2004 j.[2006h . 

According to our criteria, a galaxy is 'blue' if it has a colour 
U-B < 1. This threshold has been chosen as it corresponds 
roughly to the minimum (i.e. the green valley) in the bimodal 
U-B colour distribution. Moreover, this colour cut has been 
kept constant at all redshifts as we found that the green valley 
colour does not evolve much in the z range considered. For the 
computation of the U- and fl- band absolute magnitudes we refer 
the reader to lllbert et all d2005l) . 

Since our goal is to study ft, as a function of redshift, we first 
verified that the failure rate in redshift measurement does not 
depend on redshift for specific galaxy colours. We assigned to 
each gala xy a "photometric type" according to the scheme pro- 
posed by Zucca et al. (2006). The classification is carried out by 
fitting the Spectral Energy Distribution of galaxie s to six tem- 
plates (four observed spectra, Coleman et al J 19801 and two star- 
bu rts SEDs.lBruzual A. & Charlotlll993h . We then proceeded as 
in iFranzetti et al.l d2007l) . bv defining a broad bimodal classifi- 
cation. We considered E/S0 and early spirals as 'early type', 
and late-type spirals, irregular and starburst types as a Tate- 
type'. The relation between this classification scheme and the 
colour U-B that we use to compute ft, is monotonic, with bluer 
colours being associated to Tate type' templates. In particular, 
our 'early type' population constitutes > 90% of the galaxies 
with U-B > 1. We computed the Tate type' galaxy fraction 
in both our spectroscopic sample and in the photometric parent 
catalogue, in three redshift bins in the range 0.1 < z < 1.0 (us- 
ing photometric redshifts for the parent catalog ue, see Section 
12. ll for their determination). As already found bv lFranzetti et alJ 
(2007), who carried on a similar analysis on wider redshift in- 
tervals up to z ~ 2, the Tate type' fraction is 3% higher in the 
spectroscopic sample and this increment does not depend on red- 
shift. This result implies that any trend of ft, with redshift is not 
due to a measurement bias in our sample. 

Figure [14] shows ft, for the group galaxies (blue triangles) in 
three different redshift bins: 0.2 < z < 0.5, 0.5 < z < 0.7 and 
0.7 < z < 1.0. The vertical error bars are the lcr confidence 
levels associated to ft,, computed with the usual approx i matio n 
of the formula for binomial statistics given in iGehre ls (1986): 
o 2 = fbfr/n, where f r = 1 — fi, and n is the total number of 
galaxies in the redshift bin. 

As reference, we plot the linear fit of the three points as a 
blue line, while the upper black line is the linear fit for ft, com- 
puted within the 'total' sample, ft, is clearly lower in groups than 
in the total sample. The slopes of the two fits, together with their 



lcr confidence levels, are 0.27 + 0.07 and 0.15 + 0.02 for the 
group and total sample respectively. Although they are both sig- 
nificantly different from zero, and the group slope is steeper, they 
are compatible with each other, the group sample slope being 
steeper only at a 1.6<x significance level. At this stage of investi- 
gation we are only able to confirm the different overall value of 
f, between group galaxies and the total sample, but not their pos- 
sible different evolution. We verified that these results are stable 
against the variation of the U-B threshold adopted to define blue 
galaxies (by ±0.05 mag). They did not change significantly also 
when we implemented a colour cut that depends on luminosity, 
following the mild dependence on magnitudes of the green val- 
ley locus. Although we do not detect any redshift dependence of 
the green valley locus up to z — 1, we also allowed the colour cut 
to vary by 0. 1 mag redward for a ny redshift decrease of dz — 1 .0 
(as su ggested by Blanton 2006 and as adopted by IGerke etaD 
120071) . As a matter of fact, even in this case the two slopes are 
appreciably different from zero, and their relative difference sig- 
nificant at the ~ 1 .5cr level. 

We compared our results with those presented bv lGerke et al.l 
d2007l) . who studied the fraction of blue galaxies in groups and in 
the field within different subsamples extracted from the DEEP2 
data set. In their Sample I, the one with a selection most similar 
to ours, they find that ft, is lower in groups than in the field, but 
they do not detect any significant evolution of ft, with z neither 
in groups nor in the field. Anyway, evolutionary effect are much 
more difficult to quantify in that sample since the redshift range 
as well as the luminosity range covered is narrower with respect 
to that explored in this study. 

Our res u lts ar e in agreement with those presented by 
llovino etail ((2010), who studied the evolution of ft, in groups 
and in the field within the zCO SMOS-lOk sample (see also, 
for completeness, the analysis of Kov ac et aD 120 09 concerning 
the fraction of early type galaxies in groups). This agreement is 
based on the comparison with their Sample II, the one with a lu- 
minosity cut most similar to ours, a nd it holds for both the group 
and the total sample. Interestingly. llovino et alj d2010t) find that 
in their luminosity limited sample galaxy colour still depends on 
environment at z ~ 1 (with a trend similar to what we find in this 
work), but at the same redshift they do not converge to the same 
conclusion when the blue fraction is recovered from a (stellar) 
mass limited sample (log(M/MO) > 10.8). They explain this re- 
sult suggesting that red galaxies of such stellar masses, already 
in place at z ~ 1, may rise from internal mechanisms of evolu- 
ti on, on which envir onment has no influence. We refer the reader 
to llovino et alj d2010l) fo r more details. 

ICucciati et al.l ([2006) carried on a similar analysis using the 
same VVDS-02h data set that we use in this work. They studied 
the colour-density relation up to z = 1.5, with the local den- 
sity computed within Gaussian filters with cr — 5 fr'Mpc. They 
found that the colour-density relation becomes weaker for in- 
creasing redshift (the evolution of ft, being faster in high den- 
sities), and that at z ~ 1 no significant colour-density relation 
is detected, for galaxies with Mb < -20 (that is equivalent to 
the threshold we use in this work). Taken at face value, our 
results are not compatible with these previous findings, as we 
find that at z ~ 1 is still different in groups and in the to- 
tal sample. This difference can be explained with the fact that 
we are exploring higher densities/smaller scales (< 1 /r'Mpc, 
see Section [6]). For example, there are several studies in liter- 
ature suggesting that environmental effects on large scales are 
only a weaker residual of the ones acting on smaller scales (e.g. 
iKauffmann et alj|2004]: [B lanton et al. 2006). The same hypoth- 
esis is suggested by ICooper et alJ d2007l) . when comparing the 
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colour-density r elation found in the D EEP2 data set with the 
one presented in Cucciati et al. (2006). They still find a colour- 
density re lation at z ~ 1 , but on smaller scales than those inves- 
tigated bv lCucciati et all d2006l) . 

A direct comparison of results obtained on the basis of het- 
erogeneous definitions of the local environment (as for example 
densit y field maxima as o pposed to groups) is not straightfor- 
ward. ICooper et al] (120071) showed, for example, that the evo- 
lution of the colour-de nsity relation is co ntinuous in the range 
0.4 < z < 1.3, while Gerk e et all (120071) . who used the same 
DEEP2 data set but a different definition of environment based 
on groups, found that the evolution of //, in groups is flat in 
the range 0.7 < z < 1.0, and it steepens for 1.0 < z < 1.3. 
Nevertheless, the two works agree on the fact that at z ~ 1.3 
the colour-density relation seems to disappear. No need to em- 
phasize that the physics associated with different environmental 
definition has still to be fully understood. 

Also a direct comparison of our results with those presented 
in other works up to z ~ 1 is not trivial. Interpretation is ham- 
pered by the the non-homogeneity of group catalogues selected 
according to different selection criteria. As a consequence, the 
picture emerging from these studies is complex and sometimes 
even contradictory. As this Section is meant to give a general 
idea of the kind of studies that can be potentially carried out with 
our group sample, we wil l not en ter i n details. We rather ref er the 
reader to Pog gianti et alJ d2006l) and lAndreon et alJ d2006h for a 
more in-depth discussion about the status of the art and the prob- 
lems related to uncontrolled selection effects. 

As a second step, we examined the behavior of f, in groups 
characterized by different degrees of richness. There is still no 
agreement in literature about the dependence of fb on cluster 
properties. For example, fi, is found both to depe nd on cluster 
richness dMargoniner et alJEOOlt iGoto et al.ll2003l) . or to be in- 
depen dent of it, as well as of l ine of sight velocity dispersion an d 
mass dDe Propris et al.ll2004t lGotoll2005L iPopesso et al1l2007bl) . 
To address this issue, we associated to each group the number of 
members found by the algorithm. Nevertheless, due to the survey 
characteristic (sampling rate, spectra signal to noise ratio, etc.), 
the observed number has to be corrected to recover the real num- 
ber of members within the flux limit of the survey (Iab ^ 24). 
We did this by weighting each galaxies with the 'target sam- 
pling rate' and th e 'spectroscopic success rate' of the survey (see 
Ilbert et"aT1l2005h . We then modulated this mean weight with a 
finer correction which takes into account that the sampling rate 
is not uniform in the field. This is done by using the ^(a, S) se- 
lection function described in lCucciati et alJ d2006l) . This way, for 
each group we computed a corrected richness (N c ), which is the 
sum of the weights of those galaxies brighter than the evolving 
luminosity limit described above. In Figure[14] green diamonds, 
orange squares and red stars show fb within groups with N c > 7, 
14 and 20. We can see in the plot a general trend of decreasing 
fb for increasing N c , at any redshift explored. Given the error 
bars, this decrement is not significant when considering single 
steps in N c , but the overall tendency is clear. Nevertheless, it 
does not seem that the fb redshift evolution is different for dif- 
ferent va lues of N r . These r esults are in agreement with those 
found bv llovino et all d2010l) in their analysis of the zCOSMOS- 
10k group sample, but we extended them to fainter magnitudes. 
It would be indeed interesting to study the dependence on N c 
also for other galaxy properties, as it has been done for example 
for star formation rate ( SFR) and specific SFR (see for exam- 
ple lPopesso et al.ll20 07b). This could give hints on how different 
galaxies properties may be affected by different environments. 
We defer this study to a future work. 
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Fig. 14. Fraction of blue galaxies ( U - B < 1) for group galaxies 
(blue empty triangles) in three different redshift bins: 0.2 < z < 
0.5, 0.5 < z < 0.7 and 0.7 < z < 1.0. The linear fit of this three 
points is overplotted as a blue line, while the upper black line is 
the linear fit for fb computed within the 'total' sample, he dashed 
areas along the two linear fits show the locus where the linear fits 
could lie considering their 1-cr error on both intercept and slope. 
See text for more details. Other symbols are for group galaxies 
in groups with increasing corrected richness: Ncorr > 7, 14,20 
for green diamonds, orange squares and red stars respectively. 

8. Summary and conclusions 

We have compiled a homogeneous catalogue of optical groups 
identified in the VVDS-02 field by means of the VDM algo- 
rithm, in the range 0.2 < z < 1 .0. 

We used mock catalogues simulating the VVDS survey to 
optimize the performances of the group-finding algorithm (max- 
imizing the completeness and the purity of the resulting group 
catalogue) as well as to minimize possible selection effects. Our 
main results are here summarized. 

- Using the mock catalogues, we verified that the VVDS-02h 
survey sampling rate allows us to recover at least 50% of 
the groups (with a virial line of sight velocity dispersion 
cr Wr > 350 km/s) that are potentially present in the parent 
photometric catalogue up to z — 1 . 

- We tested how well <r,,„. of the halo mass particles can be 
estimated using sparsely sampled galaxy velocities. We ver- 
ified that with this method, given the characteristics of our 
survey (flux limit, sampling rate, redshift measurement error) 
we are able to recover a sensible value of cr yi > for cr V! > > 350 
km/s. 

- Applying the optimized algorithm to the VVDS real data set, 
we obtained a catalogue of 318 groups of galaxies (with at 
least two members) in the range 0.2 < z < 1 .0. Among these 
groups, 63 have a measured line of sight velocity dispersion 
greater than 350 km/s. The group catalogue is characterized 
by an overall completeness of ~ 60% and a purity of ~ 50%. 
Nearly 19% of the total population of galaxies live in these 
systems. 

- the number density distribution as a function of both redshift 
(n(z)) and velocity dispersion («(<x)) of the VVDS groups 
with <x > 350 km/s scales in qualitative agreement with the 
analogous statistics recovered from the mock catalogues. 

- We studied the fraction f, of blue galaxies (U-B < l)in the 
range 0.2 < z < 1. We used a luminosity-limited subsample 
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of galaxies extracted from our data (Mb < -18.9 - l.lz), 
complete up to z = 1 . We found that ft, is significantly lower 
in groups than in the global galaxy population. Moreover, ft, 
increases as a function of redshift irrespectively of the en- 
vironment, with marginal evidence for a faster growth rate 
in groups. We also analysed how ft, varies as a function of 
group richness, finding that, at any redshift explored, fi, de- 
creases in systems with increasing richness. 

Further explorations of the properties of VVDS groups is 
left to future works. We only anticipate that the high degree 
of completeness of the catalogue can be potentially exploited 
for extracting cosmological information via, for example, clus- 
ter counts techniques. The high level of purity makes the VVDS 
group sample ideal also for astrophysical studies which aim at 
tracing various physical properties of galaxies as a function of 
local density and environment. We also mention that the cross- 
correlation studies of our optically-selected catalogue with sam- 
ples inferred in the same field with independent techniques will 
help to gain insights not only on cluster selection biases but also 
on the physics at work within these extreme environments. 
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